NIST RT'05S Evaluation: Pre-processing Techniques and Speaker Diarization on Multiple Microphone Meetings

نویسندگان

  • Dan Istrate
  • Corinne Fredouille
  • Sylvain Meignier
  • Laurent Besacier
  • Jean-François Bonastre
چکیده

This paper presents different pre-processing techniques, coupled with three speaker diarization systems in the framework of the NIST 2005 Spring Rich Transcription campaign (RT’05S). The pre-processing techniques aim at providing a signal quality index in order to build unique ”virtual” signal obtained from all the microphone recordings available for a meeting. The unique ”virtual” signal relies on a weighted sum of the different microphones while the signal quality index is given according to a signal to noise ratio. Two methods are used in this paper to compute the instantaneous signal to noise ratio: speech activity detection based approach and a noise spectrum estimate. The speaker diarization task is performed using systems developed by different labs: the LIA, LIUM and CLIPS. Among the different system submissions made by these three labs, the best system obtained 24.5 % speaker diarization error for the conference subdomain and 18.4 % for lecture subdomain.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker segmentation and clustering in meetings

This paper describes the issue of automatic speaker segmentation and clustering for natural, multi-speaker meeting conversations. Two systems were developed and evaluated in the NIST RT-04S Meeting Recognition Evaluation, the Multiple Distant Microphone (MDM) system and the Individual Headset Microphone (IHM) system. The MDM system achieved a speaker diarization performance of 28.17%. This syst...

متن کامل

The LIA RT'07 Speaker Diarization System

This paper presents the LIA submission to the speaker diarization task of the 2007 NIST Rich Transcription (RT’07) evaluation campaign. We report a system optimised for conference meeting recordings and experiments on all three RT’07 subdomains and microphone conditions. Results show that, despite state-of-the-art performance for the single distant microphone (SDM) condition, in its current for...

متن کامل

The LIA-EURECOM RT‘09 Speaker Diarization System

This paper presents LIA-EURECOM’s joint submission to the NIST Rich Transcription 2009 (RT‘09) speaker diarization evaluation. We describe a number of modifications to our previous system which involve beamforming for the multiple distant microphone (MDM) condition and also significant enhancements to the speaker segmentation stage of the core speaker diarization system. These modifications lea...

متن کامل

Multi-stage Speaker Diarization for Conference and Lecture Meetings

The LIMSI RT-07S speaker diarization system for the conference and lecture meetings is presented in this paper. This system builds upon the RT06S diarization system designed for lecture data. The baseline system combines agglomerative clustering based on Bayesian information criterion (BIC) with a second clustering using state-of-the-art speaker identification (SID) techniques. Since the baseli...

متن کامل

Robust speaker diarization for meetings: ICSI RT06s evaluation system

In this paper we present the ICSI speaker diarization system submitted for the NIST Rich Transcription evaluation (RT06s) [1] conducted on the meetings environment. This is a set of yearly evaluations which in the last two years have included speaker diarization of two kinds of distinct meetings: conference room and lecture room. The system presented focuses on being robust to changes in the me...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005